Dataset statistics
| Number of variables | 10 |
|---|---|
| Number of observations | 2007 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 156.9 KiB |
| Average record size in memory | 80.1 B |
Variable types
| Numeric | 9 |
|---|---|
| Categorical | 1 |
Solids has unique values | Unique |
Reproduction
| Analysis started | 2023-08-17 19:08:16.554293 |
|---|---|
| Analysis finished | 2023-08-17 19:08:23.575040 |
| Duration | 7.02 seconds |
| Software version | pandas-profiling v3.6.6 |
| Download configuration | config.json |
ph
Real number (ℝ)
| Distinct | 627 |
|---|---|
| Distinct (%) | 31.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.0885102 |
| Minimum | 0.23 |
|---|---|
| Maximum | 14 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 15.8 KiB |
Quantile statistics
| Minimum | 0.23 |
|---|---|
| 5-th percentile | 4.623 |
| Q1 | 6.09 |
| median | 7.03 |
| Q3 | 8.05 |
| 95-th percentile | 9.797 |
| Maximum | 14 |
| Range | 13.77 |
| Interquartile range (IQR) | 1.96 |
Descriptive statistics
| Standard deviation | 1.5724527 |
|---|---|
| Coefficient of variation (CV) | 0.2218312 |
| Kurtosis | 0.62526285 |
| Mean | 7.0885102 |
| Median Absolute Deviation (MAD) | 0.99 |
| Skewness | 0.050625241 |
| Sum | 14226.64 |
| Variance | 2.4726075 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 6.51 | 11 | 0.5% |
| 7.61 | 11 | 0.5% |
| 7.37 | 11 | 0.5% |
| 6.58 | 10 | 0.5% |
| 6.98 | 10 | 0.5% |
| 6.79 | 10 | 0.5% |
| 7.29 | 10 | 0.5% |
| 6.92 | 10 | 0.5% |
| 6.85 | 10 | 0.5% |
| 6.34 | 10 | 0.5% |
| Other values (617) | 1904 |
| Value | Count | Frequency (%) |
| 0.23 | 1 | |
| 0.99 | 1 | |
| 1.43 | 1 | |
| 1.76 | 1 | |
| 1.99 | 1 | |
| 2.13 | 1 | |
| 2.38 | 1 | |
| 2.54 | 1 | |
| 2.56 | 1 | |
| 2.57 | 1 |
| Value | Count | Frequency (%) |
| 14 | 1 | |
| 13.35 | 1 | |
| 12.25 | 1 | |
| 11.9 | 1 | |
| 11.57 | 1 | |
| 11.56 | 1 | |
| 11.53 | 1 | |
| 11.5 | 2 | |
| 11.49 | 1 | |
| 11.45 | 1 |
Hardness
Real number (ℝ)
| Distinct | 1819 |
|---|---|
| Distinct (%) | 90.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 195.96 |
| Minimum | 73.49 |
|---|---|
| Maximum | 317.34 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 15.8 KiB |
Quantile statistics
| Minimum | 73.49 |
|---|---|
| 5-th percentile | 141.238 |
| Q1 | 176.735 |
| median | 197.19 |
| Q3 | 216.465 |
| 95-th percentile | 248.851 |
| Maximum | 317.34 |
| Range | 243.85 |
| Interquartile range (IQR) | 39.73 |
Descriptive statistics
| Standard deviation | 32.662271 |
|---|---|
| Coefficient of variation (CV) | 0.16667826 |
| Kurtosis | 0.52235747 |
| Mean | 195.96 |
| Median Absolute Deviation (MAD) | 19.94 |
| Skewness | -0.084561181 |
| Sum | 393291.71 |
| Variance | 1066.8239 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 208.91 | 4 | 0.2% |
| 169.4 | 3 | 0.1% |
| 166.64 | 3 | 0.1% |
| 205.21 | 3 | 0.1% |
| 203.4 | 3 | 0.1% |
| 203.07 | 3 | 0.1% |
| 185.34 | 3 | 0.1% |
| 200.71 | 3 | 0.1% |
| 227.23 | 3 | 0.1% |
| 234.78 | 3 | 0.1% |
| Other values (1809) | 1976 |
| Value | Count | Frequency (%) |
| 73.49 | 1 | |
| 77.46 | 1 | |
| 81.71 | 1 | |
| 94.09 | 1 | |
| 94.81 | 1 | |
| 94.91 | 1 | |
| 97.28 | 1 | |
| 98.45 | 1 | |
| 98.77 | 1 | |
| 100.46 | 1 |
| Value | Count | Frequency (%) |
| 317.34 | 1 | |
| 306.63 | 1 | |
| 300.29 | 1 | |
| 287.98 | 1 | |
| 286.57 | 1 | |
| 284 | 1 | |
| 283.9 | 1 | |
| 282.74 | 1 | |
| 281.59 | 1 | |
| 280.09 | 1 |
Solids
Real number (ℝ)
| Distinct | 2007 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 21918.672 |
| Minimum | 320.94 |
|---|---|
| Maximum | 56488.67 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 15.8 KiB |
Quantile statistics
| Minimum | 320.94 |
|---|---|
| 5-th percentile | 9550.156 |
| Q1 | 15615.665 |
| median | 20933.51 |
| Q3 | 27195.355 |
| 95-th percentile | 38304.702 |
| Maximum | 56488.67 |
| Range | 56167.73 |
| Interquartile range (IQR) | 11579.69 |
Descriptive statistics
| Standard deviation | 8648.7693 |
|---|---|
| Coefficient of variation (CV) | 0.39458454 |
| Kurtosis | 0.34105125 |
| Mean | 21918.672 |
| Median Absolute Deviation (MAD) | 5760.25 |
| Skewness | 0.59549119 |
| Sum | 43990775 |
| Variance | 74801210 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 19451.77 | 1 | < 0.1% |
| 19598.86 | 1 | < 0.1% |
| 14464.12 | 1 | < 0.1% |
| 20229.11 | 1 | < 0.1% |
| 27776.9 | 1 | < 0.1% |
| 12145.54 | 1 | < 0.1% |
| 16233.13 | 1 | < 0.1% |
| 16559.88 | 1 | < 0.1% |
| 11619.71 | 1 | < 0.1% |
| 25419.77 | 1 | < 0.1% |
| Other values (1997) | 1997 |
| Value | Count | Frequency (%) |
| 320.94 | 1 | |
| 1198.94 | 1 | |
| 1351.91 | 1 | |
| 1372.09 | 1 | |
| 2552.96 | 1 | |
| 3413.08 | 1 | |
| 3640.73 | 1 | |
| 4111.79 | 1 | |
| 4168.2 | 1 | |
| 4304.49 | 1 |
| Value | Count | Frequency (%) |
| 56488.67 | 1 | |
| 56351.4 | 1 | |
| 55334.7 | 1 | |
| 53735.9 | 1 | |
| 50793.9 | 1 | |
| 50279.26 | 1 | |
| 49074.73 | 1 | |
| 49009.92 | 1 | |
| 48621.56 | 1 | |
| 48204.17 | 1 |
Chloramines
Real number (ℝ)
| Distinct | 638 |
|---|---|
| Distinct (%) | 31.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.1375386 |
| Minimum | 1.39 |
|---|---|
| Maximum | 13.13 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 15.8 KiB |
Quantile statistics
| Minimum | 1.39 |
|---|---|
| 5-th percentile | 4.553 |
| Q1 | 6.15 |
| median | 7.15 |
| Q3 | 8.11 |
| 95-th percentile | 9.757 |
| Maximum | 13.13 |
| Range | 11.74 |
| Interquartile range (IQR) | 1.96 |
Descriptive statistics
| Standard deviation | 1.5844607 |
|---|---|
| Coefficient of variation (CV) | 0.22198978 |
| Kurtosis | 0.55684227 |
| Mean | 7.1375386 |
| Median Absolute Deviation (MAD) | 0.98 |
| Skewness | 0.010101841 |
| Sum | 14325.04 |
| Variance | 2.5105156 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 6.61 | 11 | 0.5% |
| 7.49 | 11 | 0.5% |
| 7.63 | 11 | 0.5% |
| 7.84 | 11 | 0.5% |
| 7.3 | 10 | 0.5% |
| 7.69 | 10 | 0.5% |
| 7.4 | 10 | 0.5% |
| 7.57 | 10 | 0.5% |
| 6.19 | 10 | 0.5% |
| 7.66 | 10 | 0.5% |
| Other values (628) | 1903 |
| Value | Count | Frequency (%) |
| 1.39 | 1 | |
| 1.92 | 1 | |
| 2.4 | 1 | |
| 2.46 | 2 | |
| 2.48 | 1 | |
| 2.5 | 1 | |
| 2.62 | 1 | |
| 2.65 | 2 | |
| 2.73 | 1 | |
| 2.74 | 1 |
| Value | Count | Frequency (%) |
| 13.13 | 1 | |
| 13.04 | 1 | |
| 12.65 | 1 | |
| 12.63 | 1 | |
| 12.58 | 1 | |
| 12.25 | 1 | |
| 12.23 | 1 | |
| 12.06 | 1 | |
| 11.99 | 1 | |
| 11.93 | 1 |
Sulfate
Real number (ℝ)
| Distinct | 1869 |
|---|---|
| Distinct (%) | 93.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 333.24903 |
| Minimum | 129 |
|---|---|
| Maximum | 481.03 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 15.8 KiB |
Quantile statistics
| Minimum | 129 |
|---|---|
| 5-th percentile | 267.697 |
| Q1 | 307.63 |
| median | 332.23 |
| Q3 | 359.4 |
| 95-th percentile | 401.599 |
| Maximum | 481.03 |
| Range | 352.03 |
| Interquartile range (IQR) | 51.77 |
Descriptive statistics
| Standard deviation | 41.232106 |
|---|---|
| Coefficient of variation (CV) | 0.12372761 |
| Kurtosis | 0.78422655 |
| Mean | 333.24903 |
| Median Absolute Deviation (MAD) | 25.73 |
| Skewness | -0.047571658 |
| Sum | 668830.81 |
| Variance | 1700.0866 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 319.25 | 3 | 0.1% |
| 320.26 | 3 | 0.1% |
| 343.29 | 3 | 0.1% |
| 339.06 | 3 | 0.1% |
| 340.98 | 3 | 0.1% |
| 318.79 | 3 | 0.1% |
| 367.33 | 2 | 0.1% |
| 322.1 | 2 | 0.1% |
| 338.58 | 2 | 0.1% |
| 338.05 | 2 | 0.1% |
| Other values (1859) | 1981 |
| Value | Count | Frequency (%) |
| 129 | 1 | |
| 180.21 | 1 | |
| 182.4 | 1 | |
| 187.17 | 1 | |
| 187.42 | 1 | |
| 192.03 | 1 | |
| 203.44 | 1 | |
| 205.94 | 1 | |
| 206.25 | 1 | |
| 207.89 | 1 |
| Value | Count | Frequency (%) |
| 481.03 | 1 | |
| 476.54 | 1 | |
| 475.74 | 1 | |
| 460.11 | 1 | |
| 458.44 | 1 | |
| 450.91 | 1 | |
| 447.42 | 1 | |
| 446.72 | 1 | |
| 445.94 | 1 | |
| 445.36 | 1 |
Conductivity
Real number (ℝ)
| Distinct | 1934 |
|---|---|
| Distinct (%) | 96.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 426.52422 |
| Minimum | 201.62 |
|---|---|
| Maximum | 753.34 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 15.8 KiB |
Quantile statistics
| Minimum | 201.62 |
|---|---|
| 5-th percentile | 300.452 |
| Q1 | 366.68 |
| median | 423.42 |
| Q3 | 482.525 |
| 95-th percentile | 564.748 |
| Maximum | 753.34 |
| Range | 551.72 |
| Interquartile range (IQR) | 115.845 |
Descriptive statistics
| Standard deviation | 80.761753 |
|---|---|
| Coefficient of variation (CV) | 0.18934857 |
| Kurtosis | -0.24136586 |
| Mean | 426.52422 |
| Median Absolute Deviation (MAD) | 57.94 |
| Skewness | 0.26746326 |
| Sum | 856034.11 |
| Variance | 6522.4607 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 412.71 | 3 | 0.1% |
| 517.43 | 3 | 0.1% |
| 402.66 | 3 | 0.1% |
| 404.2 | 2 | 0.1% |
| 494.15 | 2 | 0.1% |
| 468.37 | 2 | 0.1% |
| 415.01 | 2 | 0.1% |
| 392.7 | 2 | 0.1% |
| 411.3 | 2 | 0.1% |
| 351.48 | 2 | 0.1% |
| Other values (1924) | 1984 |
| Value | Count | Frequency (%) |
| 201.62 | 1 | |
| 210.32 | 1 | |
| 233.91 | 1 | |
| 245.86 | 1 | |
| 252.97 | 1 | |
| 254.39 | 2 | |
| 257.01 | 1 | |
| 257.7 | 1 | |
| 258.88 | 1 | |
| 259.96 | 1 |
| Value | Count | Frequency (%) |
| 753.34 | 1 | |
| 708.23 | 1 | |
| 695.37 | 1 | |
| 669.73 | 1 | |
| 666.69 | 1 | |
| 657.57 | 1 | |
| 656.92 | 1 | |
| 652.54 | 1 | |
| 649.81 | 1 | |
| 646.73 | 1 |
Organic_carbon
Real number (ℝ)
| Distinct | 1028 |
|---|---|
| Distinct (%) | 51.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 14.366153 |
| Minimum | 2.2 |
|---|---|
| Maximum | 27.01 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 15.8 KiB |
Quantile statistics
| Minimum | 2.2 |
|---|---|
| 5-th percentile | 8.953 |
| Q1 | 12.13 |
| median | 14.33 |
| Q3 | 16.69 |
| 95-th percentile | 19.64 |
| Maximum | 27.01 |
| Range | 24.81 |
| Interquartile range (IQR) | 4.56 |
Descriptive statistics
| Standard deviation | 3.3219741 |
|---|---|
| Coefficient of variation (CV) | 0.23123616 |
| Kurtosis | 0.039087648 |
| Mean | 14.366153 |
| Median Absolute Deviation (MAD) | 2.26 |
| Skewness | -0.021355161 |
| Sum | 28832.87 |
| Variance | 11.035512 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 14.35 | 8 | 0.4% |
| 13.79 | 7 | 0.3% |
| 12.07 | 6 | 0.3% |
| 16.14 | 6 | 0.3% |
| 14.25 | 6 | 0.3% |
| 15.67 | 6 | 0.3% |
| 12.49 | 6 | 0.3% |
| 18.02 | 5 | 0.2% |
| 14.98 | 5 | 0.2% |
| 11.67 | 5 | 0.2% |
| Other values (1018) | 1947 |
| Value | Count | Frequency (%) |
| 2.2 | 1 | |
| 4.37 | 1 | |
| 4.47 | 1 | |
| 4.86 | 1 | |
| 4.97 | 1 | |
| 5.16 | 1 | |
| 5.19 | 1 | |
| 5.2 | 1 | |
| 5.22 | 1 | |
| 5.32 | 1 |
| Value | Count | Frequency (%) |
| 27.01 | 1 | |
| 24.76 | 1 | |
| 23.92 | 1 | |
| 23.6 | 1 | |
| 23.57 | 1 | |
| 23.4 | 1 | |
| 23.37 | 1 | |
| 23.32 | 1 | |
| 23.23 | 1 | |
| 23.14 | 1 |
Trihalomethanes
Real number (ℝ)
| Distinct | 1686 |
|---|---|
| Distinct (%) | 84.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 66.405057 |
| Minimum | 8.58 |
|---|---|
| Maximum | 124 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 15.8 KiB |
Quantile statistics
| Minimum | 8.58 |
|---|---|
| 5-th percentile | 39.585 |
| Q1 | 55.955 |
| median | 66.54 |
| Q3 | 77.31 |
| 95-th percentile | 91.659 |
| Maximum | 124 |
| Range | 115.42 |
| Interquartile range (IQR) | 21.355 |
Descriptive statistics
| Standard deviation | 16.08709 |
|---|---|
| Coefficient of variation (CV) | 0.242257 |
| Kurtosis | 0.22234742 |
| Mean | 66.405057 |
| Median Absolute Deviation (MAD) | 10.66 |
| Skewness | -0.051624408 |
| Sum | 133274.95 |
| Variance | 258.79447 |
| Monotonicity | Increasing |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 68.91 | 4 | 0.2% |
| 55.4 | 4 | 0.2% |
| 63.7 | 4 | 0.2% |
| 42.29 | 3 | 0.1% |
| 73.19 | 3 | 0.1% |
| 81.39 | 3 | 0.1% |
| 52.42 | 3 | 0.1% |
| 47.1 | 3 | 0.1% |
| 62.32 | 3 | 0.1% |
| 66.69 | 3 | 0.1% |
| Other values (1676) | 1974 |
| Value | Count | Frequency (%) |
| 8.58 | 1 | |
| 14.34 | 1 | |
| 15.68 | 1 | |
| 16.29 | 1 | |
| 17.53 | 1 | |
| 17.92 | 1 | |
| 18.02 | 1 | |
| 19.18 | 1 | |
| 22.22 | 1 | |
| 23.14 | 1 |
| Value | Count | Frequency (%) |
| 124 | 1 | |
| 120.03 | 1 | |
| 116.16 | 1 | |
| 114.21 | 1 | |
| 114.03 | 1 | |
| 113.05 | 1 | |
| 112.62 | 1 | |
| 111.6 | 1 | |
| 111.12 | 1 | |
| 110.43 | 1 |
Turbidity
Real number (ℝ)
| Distinct | 362 |
|---|---|
| Distinct (%) | 18.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.9700897 |
| Minimum | 1.45 |
|---|---|
| Maximum | 6.49 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 15.8 KiB |
Quantile statistics
| Minimum | 1.45 |
|---|---|
| 5-th percentile | 2.69 |
| Q1 | 3.44 |
| median | 3.97 |
| Q3 | 4.515 |
| 95-th percentile | 5.2 |
| Maximum | 6.49 |
| Range | 5.04 |
| Interquartile range (IQR) | 1.075 |
Descriptive statistics
| Standard deviation | 0.77995117 |
|---|---|
| Coefficient of variation (CV) | 0.19645681 |
| Kurtosis | -0.045387937 |
| Mean | 3.9700897 |
| Median Absolute Deviation (MAD) | 0.53 |
| Skewness | -0.033571951 |
| Sum | 7967.97 |
| Variance | 0.60832382 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 3.92 | 20 | 1.0% |
| 3.63 | 17 | 0.8% |
| 4.18 | 17 | 0.8% |
| 4.37 | 16 | 0.8% |
| 4.24 | 16 | 0.8% |
| 4.59 | 15 | 0.7% |
| 3.42 | 15 | 0.7% |
| 3.7 | 15 | 0.7% |
| 3.08 | 14 | 0.7% |
| 4.1 | 14 | 0.7% |
| Other values (352) | 1848 |
| Value | Count | Frequency (%) |
| 1.45 | 1 | |
| 1.49 | 1 | |
| 1.5 | 1 | |
| 1.68 | 1 | |
| 1.81 | 1 | |
| 1.84 | 1 | |
| 1.87 | 1 | |
| 1.91 | 1 | |
| 1.92 | 1 | |
| 1.96 | 2 |
| Value | Count | Frequency (%) |
| 6.49 | 2 | |
| 6.39 | 1 | |
| 6.36 | 1 | |
| 6.31 | 1 | |
| 6.23 | 1 | |
| 6.08 | 1 | |
| 6.06 | 1 | |
| 6.03 | 1 | |
| 5.99 | 2 | |
| 5.96 | 1 |
Potability
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.8 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 2007 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 1196 | |
| 1 | 811 |
Length
Histogram of lengths of the category
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 1196 | |
| 1 | 811 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1196 | |
| 1 | 811 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2007 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1196 | |
| 1 | 811 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2007 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1196 | |
| 1 | 811 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2007 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1196 | |
| 1 | 811 |
| ph | Hardness | Solids | Chloramines | Sulfate | Conductivity | Organic_carbon | Trihalomethanes | Turbidity | Potability | |
|---|---|---|---|---|---|---|---|---|---|---|
| ph | 1.000 | 0.138 | -0.080 | -0.040 | 0.015 | 0.010 | 0.024 | 0.018 | -0.047 | 0.097 |
| Hardness | 0.138 | 1.000 | -0.050 | -0.020 | -0.097 | 0.002 | 0.012 | -0.022 | -0.029 | 0.074 |
| Solids | -0.080 | -0.050 | 1.000 | -0.041 | -0.143 | 0.005 | 0.004 | -0.022 | 0.026 | 0.046 |
| Chloramines | -0.040 | -0.020 | -0.041 | 1.000 | 0.023 | -0.021 | -0.023 | 0.015 | 0.000 | 0.076 |
| Sulfate | 0.015 | -0.097 | -0.143 | 0.023 | 1.000 | -0.020 | 0.016 | -0.027 | -0.013 | 0.148 |
| Conductivity | 0.010 | 0.002 | 0.005 | -0.021 | -0.020 | 1.000 | 0.017 | -0.006 | 0.022 | 0.000 |
| Organic_carbon | 0.024 | 0.012 | 0.004 | -0.023 | 0.016 | 0.017 | 1.000 | -0.004 | -0.011 | 0.000 |
| Trihalomethanes | 0.018 | -0.022 | -0.022 | 0.015 | -0.027 | -0.006 | -0.004 | 1.000 | -0.023 | 0.000 |
| Turbidity | -0.047 | -0.029 | 0.026 | 0.000 | -0.013 | 0.022 | -0.011 | -0.023 | 1.000 | 0.000 |
| Potability | 0.097 | 0.074 | 0.046 | 0.076 | 0.148 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 |
A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
| ph | Hardness | Solids | Chloramines | Sulfate | Conductivity | Organic_carbon | Trihalomethanes | Turbidity | Potability | |
|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 5.80 | 193.20 | 19451.77 | 4.15 | 255.98 | 365.48 | 14.92 | 8.58 | 2.18 | 1 |
| 1 | 7.78 | 196.82 | 24789.35 | 6.55 | 331.04 | 372.76 | 12.07 | 14.34 | 5.05 | 0 |
| 2 | 6.95 | 214.17 | 32946.57 | 5.48 | 333.44 | 318.88 | 12.81 | 15.68 | 4.93 | 0 |
| 3 | 8.28 | 227.65 | 17995.41 | 7.49 | 323.38 | 459.87 | 14.36 | 16.29 | 3.69 | 0 |
| 4 | 7.06 | 191.55 | 16473.07 | 8.44 | 367.85 | 462.99 | 12.57 | 17.53 | 3.94 | 1 |
| 5 | 6.39 | 213.02 | 20965.48 | 5.38 | 327.65 | 369.34 | 13.76 | 17.92 | 3.92 | 0 |
| 6 | 6.64 | 215.06 | 16488.05 | 6.64 | 304.76 | 507.13 | 11.98 | 18.02 | 4.71 | 1 |
| 7 | 6.25 | 163.22 | 26408.88 | 6.03 | 429.02 | 509.96 | 23.57 | 19.18 | 5.04 | 1 |
| 8 | 6.26 | 270.47 | 8572.42 | 9.92 | 286.33 | 490.94 | 12.93 | 22.22 | 4.75 | 0 |
| 9 | 7.18 | 201.08 | 25234.43 | 5.22 | 283.74 | 384.01 | 12.43 | 23.14 | 3.67 | 0 |
| ph | Hardness | Solids | Chloramines | Sulfate | Conductivity | Organic_carbon | Trihalomethanes | Turbidity | Potability | |
|---|---|---|---|---|---|---|---|---|---|---|
| 1997 | 8.37 | 179.52 | 22022.63 | 5.22 | 339.49 | 396.70 | 13.70 | 110.43 | 2.79 | 0 |
| 1998 | 4.63 | 208.91 | 29307.13 | 6.13 | 304.03 | 456.21 | 10.82 | 111.12 | 4.75 | 0 |
| 1999 | 7.31 | 193.47 | 19343.15 | 7.66 | 306.69 | 426.56 | 12.84 | 111.60 | 4.05 | 0 |
| 2000 | 9.16 | 186.67 | 15797.03 | 8.15 | 333.81 | 425.75 | 12.18 | 112.62 | 4.53 | 1 |
| 2001 | 6.34 | 164.07 | 26594.35 | 7.38 | 338.43 | 607.10 | 14.93 | 113.05 | 4.58 | 1 |
| 2002 | 8.29 | 151.57 | 14402.73 | 9.05 | 303.08 | 322.52 | 13.65 | 114.03 | 4.27 | 1 |
| 2003 | 8.97 | 195.74 | 9049.68 | 7.47 | 396.45 | 378.53 | 17.76 | 114.21 | 3.98 | 0 |
| 2004 | 5.04 | 190.16 | 29258.74 | 4.99 | 300.48 | 332.36 | 11.06 | 116.16 | 3.53 | 1 |
| 2005 | 6.15 | 197.54 | 39657.27 | 9.90 | 288.16 | 319.43 | 11.59 | 120.03 | 4.60 | 0 |
| 2006 | 7.90 | 210.73 | 15896.37 | 6.91 | 319.89 | 448.67 | 18.17 | 124.00 | 2.85 | 1 |